AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Screen Content Reasoning

# Screen Content Reasoning

Ferret UI Llama8b
Ferret-UI is the first multimodal large language model (MLLM) focused on user interfaces, built on Llama-3-8B, capable of performing complex UI tasks such as referencing, localization, and reasoning.
Image-to-Text Transformers
F
jadechoghari
256
69
Ferret UI Gemma2b
Ferret-UI is the first multimodal large language model focused on user interfaces, built on Gemma-2B, specifically designed for UI referencing, localization, and reasoning tasks.
Image-to-Text Transformers
F
jadechoghari
302
50
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase